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Speci&cation 
[l. Title of the Invention] 
Instruction Pre-fetch Control System 

[2.Claims] 

[Claim l] An instruction pre-fetch system comprising! 
a storage means to store a pre-fetched instruction, 

a detection means to detect instruction word length of the pre-fetched 
instruction before the pre-fetched instruction is stored in said storage means, 
a decode means to detect the pre-fetched instruction is at least a branch 
instruction before the pre-fetched instruction is stored in said storage means, 
a control means to request abortion of instruction pre-fetching as soon as all 
bytes composing said branch instruction are stored in said storage means, 
according to control information from said detection means and said decode 
means. 

[3. Detailed description of the Invention] 

The present invention* is 'related to a computer, 'especially related to a 
pre-fetch control system of an instruction. 

[Prior Arts] 

One of methods for increasing speed of a computer is an instruction pre-fetch 
control system. One of characteristics of a von Neumann computer is a high 
probabiUty that an instruction to be executed in the next is an instruction 
follows an instruction currently being executed. The instruction pre-fetch 
system utilizes the characteristic to decrease a time reqviires for instruction 
fetching with performing an instruction fetching to a sequential file during a 



spare time of an external bus, and storing in an instruction buffer placed in 
the computer beforehand. 

Fig.4 is a block diagram of a CPU having a traditional pre-fetch mechanism. 
CPU2 exchanges data with an external bus 1. When a fetch request signal 
Une 119 is in high-level, a bus control part 113 whose input is the fetch 
request signal Une 119 issues an instruction fetch request to the external 
memory. When the fetch request is issued, the bus control part 113 outputs 
an address of an instruction to be fetched on an address data bus 115, as a 
result, an instruction stored in said address of the external memory is input 
to the bus control part 113 through the address data bus 115. Furthermore, 
an internal instruction buffer 102 stored in the internal instruction buffer 
102 through an internal data bus 101 has a FIFO (First In First Out) 
structure and entered in a decoder 106 through an instruction bus 105 in 
order of fetched instructions. A queue write counter 104 indicates a position 
of the internal instruction buffer 102 to which an instruction fetched from 
the external memory is written, and a queue read counter 103 indicates a 
position from which an instruction stored in the internal instruction buffer 
102 is read out to a decoder 106. That is, a difference between the queue 
write counter 10 and the queue read counter 103 is a total number of bytes 
currently stored in the internal instruction buffer 102. An output of the 
decoder 106 notifies a start address of a micro program 108 through a micro 
instruction address path 107. By execution of the micro program, a group of 
control signals 109 is output to an execution unit 110, and in the execution 
unit 110, read/write operation of data with the external memory through the 
bus control part 113 according to the group of control signals 109. Fetching of 
an instruction is performed based on the fetch request from the internal 
instruction buffer 102-in the spare time of the external bus 1 which does not 
have data read/write. However, if data read/write request is generated fi:om 
the execution unit 110 during the external bus 1 is performing inistruction 
fetching, the read/write request is kept waiting until the instruction fetching 
currently performed is completed. 

The fetch request is issued when it is determined that there is a vacant 
buffer in the internal instruction buffer 102 by the queue control part 116 
whose inputs are the queue read counter -103 and the queue write counter 
104, then a fetch request signal Une 117 becomes to be a high-level. However, 
when there is no fetch request, the fetch request signal line 117 is in 



low-level. Though the instruction fetching is effective under the assumption 
that most of instructions are performed sequentially; when there is a branch 
instruction which disrupts a flow of program such as a jump, a call or a 
return instruction, all instructions after the branch instruction stored in the 
internal instruction buffer 102 have to be deleted, and instructions at branch 
destination have to be fetched again. As a result, deleted instruction fetch is 
unproductive. When the unproductive iastruction fetch is performed in a 
perfect spare time of the external bus 1 in which there is no memory access 
such as data read/write, there is no problem. However, if a memory access 
request arises while the improductive instruction fetch is performed, the 
memory access request is kept waiting until the unproductive iastruction 
fetch cyde-is completed, then the CPU 2 spent wasteful time for the 
unproductive fetch. As a result, the wasteftd time causes delay of the 
execution time of the computer as a whole. Consequendy, as one of 
traditional instruction pre-fetch controls, there is a method to abort 
instruction fetching after decoding all instruction bytes composing a branch 
instruction with the decoder 105 for reducing instruction fetch which will be 
unproductive after a branch instruction. 

An instruction pre-fetch control system above mentioned is a system 
described in IEEE Spectrum, 1979, March, P28 - 34. 

The method above mentioned has a defect that all bytes composing a branch 
instruction are perfectiy decoded, unproductive iastruction fetching is 
performed until an abort request is issued, then execution time of a computer 
is delayed. 

The purpose of the present invention is to provide an instruction pre-fetch 
control system in which increasing of processing speed of a computer with 
minimizing unproductive f(etch" with^a^ stage 
by detecting a branch iastruction earlier. 

[Means to Solve the Problem] 

In an iastruction pre-fetch system of an information processing device, an 
instruction pre-fetch system according to the present invention comprising; 
a storage means to store a pre-fetched instruction, a detection means to 
detect iastruction word length of the pre-fetched instruction* before the 
pre-fetched instruction is stored in said storage means, a decode means to 
detect the pre-fetched instruction is at least a branch instruction before the 



pre-fetched instruction is stored in said storage means, a control means to 
request abortion of instruction pre-fetching as soon as all bytes composing 
said branch instruction are stored in said storage means, according to control 
information from said detection means and said decode means. 

[Embodiments] 

Next, an embodiment of the present invention is explained with drawings. 
Fig. 1 is a block diagram of one embodiment of the present invention. 
This embodiment comprises; an internal instruction buffer 202 as a storage 
means to store a pre-fetched instruction, a pre-decoder 201 as a detection 
means to detect instruction word length of the pre-fetched instruction before 
the pre-fetched instruction is stored in the internal instruction buffer 202, a 
decoder included in the pre-decoder 201 as a decode means to detect the 
pre-fetched instruction is at least a branch instruction before the pre-fetched 
instruction is stored in the internal instruction buffer 202, a two inputs AND 
circuit 218 as a means to request abortion of pre-fetching as soon as all bytes 
composing the ranch instruction are stored in the internal instruction buffer 
202 according to a control information from the pre-decoder 201. 
Next, an operation of this embodiment is explained. 

CPU 4 exchanges data with the external data bus 3, and when a fetch 
request signal 219 is in the high-level, the bus control part 213 issues an 
instruction fetch request to the external memory. When the fetch request is 
issued, the bus control part 213 outputs an address of an instruction to be 
fetched to the external bus 3, as a result, through the address data bus 215 
again, the instruction to be fetched in the external memory from the external 
bus 3 to the bus control part 213 through the address data bus 215. 
Furthermore, the instruction input in the bus control' part 213 is stored in 
the internal instruction buffer 202 through the internal data bus 201. The 
internal instruction buffer 202 has a FIFO structure, and instructions are 
sent to the decoder 206 through the instruction bus in order of being fetched. 
The queue write counter 204 points a position of the internal instruction 
buffer 202 to whidi the instruction fetched from the external memory is 
written, the queue read counter 203 points a position from whidi the 
instruction stored in the internal instruction buffer- 202 is-read out to the 
decoder 206. That is, the difference between the queue write counter 204 and 
the queue read counter 203 is a total number of bytes stored in the internal 



instruction buffer 202. Moreover, an output of the decoder 205 notifies a start 
address of a micro program 208 through a micro instruction address bus 207, 
a group of control signals 209 is output to an execution imit 2 10 by execution 
of the micro program. The execution unit 210 performs read/write operation 
with the external memory through the bus control part 213 according to the 
group of control signals 209. Fetching of an instruction is performed 
according to a fetch request signal 217 from the queue control part 216 
during a spare time of the external bus 3 where there is no read/write cycle of 
data. The fetch request is issued when it is determined that there is a vacant 
buffer in the internal instruction buffer 202 by the queue control part 316 
whose inputs are the queue write counter 204 and the queue read counter 
203. Then -a-*fetGh «request signal 217 from the queue- control part 216 
becomes to be high-level, and a fetch request from the internal instruction 
buffer 202 is generated. Here, when there is no fetch request, the fetch 
request signal 217 is in low-level. As mentioned in an example of the prior 
arts, the instruction fetch is effective when it is assumed that most of 
instructions are executed sequentially. However, if there is an instruction 
which disrupts a flow of program such as a jump, ^ a call or a return 
instruction, all instructions after the branch instruction stored in the 
internal instruction buffer 202 have to be deleted, and instructions at branch 
destination have to be fetched again. Therefore, in this embodiment, a 
pre- decoder 220 is placed in a step prior to the internal instruction bvtffer, 
the pre-decoder 220 requests abortion of fetching when it is detected that the 
fetched instruction is a branch instruction and all instruction bytes 
composing the branch instruction are stored in the internal instruction 
buffer 202, with making a fetch abortion signal hne 221 in low level. 
Wherein, the fetch abortion 'signal lLQ~e*22 1 is usuaUy When the 

fetch abortion signal line 221 is in low level, an output signal Hne 219 of a 
two- inputs AND circuit 2 18 is in low-level to notify abortion of instruction 
fetching to the bus control part 213. On the other hand, when the fetch 
abortion signal line 221 is in high-level, there is a vacant buffer in the 
internal instruction buffer 202, and the fetch request signal Hne 217 from the 
queue control part 216 is in high-level (fetch request), an output Hne 219 of 
the two-inputs AND circuit 2-15 is in high-level to request fetching to the bus 
control part 213. 

The present invention provides a means of minimizing unproductive fetching 



with detecting a branch instruction earlier, as soon as it is detected that all 
instruction bjrtes composing the branch instruction are stored in the internal 
instruction buffer 202, abortion of fetching is requested to the bus control 
part 213 by placing the pre-decoder 220 in a step prior to the internal 
instruction buffer 202. 

Fig.2 is a detailed circuit diagram of the pre-decoder shown in Fig.l. 
A decoder 305 detects instruction word length currently fetched from data 
input from a signal line 303, and outputs to a counter 309 through a signal 
line 307. A decoder 306 detects whether a currently fetched instruction is a 
branch instruction, when the instruction is a branch instruction, and makes 
the control signal line 308 in high-level. A counter 309 inputs the instruction 
word length o£-a currently fetched instruction, when a control signal 223 
from the bus control part 213 is in high-level, performs decrement operation 
once to decrement stored contents and outputs contents after decrement to a 
latch circuit 310 simultaneously. Wherein, the counter 309 makes the control 
signal 311 in high-level when its value is "0", and the latch circuit makes the 
control signal 312 in high-level when its value is "0". Furthermore, each 
value of the counter 309 and the latch circuit 310 is set to "0'' in the initial 
state (at resetting). A control signal line 222 from the bus control part 213 is 
a control signal in high-level when a currently executed bus cycle is a fetch 
cycle, a control signal line 223 is a control signal which becomes in high-level 
1 clock delayed compared to the control signal line 222. Inputs of the 
two-inputs AND circuit 313 are the control signal line 222 and the control 
signal hne 312, when the control signal line 222 is in high-level, that is in a 
fetch cycle, and the control signal line 312 is in high-level, that is a value of 
the latch circuit 310 is "0", the control signal line is made to be in high-level 
to make both of the gate^301 and the gate 302 to be ON. — — 
The gates 301 and 302 output a value on the internal data bus 201 to the 
data bus 303 and the data bus 304 respectively when they are ON states. 
Inputs of the two-inputs AND circuit 315 are the control signal line 311 and 
the control signal Hne 305, the control signal line 316 is made to be high-level 
when the control signal line 311 is in high-level, that is a value of the counter 
is "0", and the control signal line 308 is in high-level, that is the currently 
fetched instruction is a branch instruction. The latch- circviit--3-l-7 -makes the 
control signal line 221 in low-level when the control signal hne 316 is in 
high-level, and outputs low-level until the control signal hne 318 becomes to 



be low-level. The control signal line 318 becomes to be low-level according to 
a fetch re-start request when a branch destination address is determined. 
When the control signal line 318 becomes to be low-level, the latch circxiit 
317 makes the control signal line 221 in high-level, and the control signal 
hne 221 outputs high-level until the control signal line 316 becomes in 
high-level again. 

Next, an operation of pre-decoder 220 shown in Fig.2 is explained with a 
timin g chart shown in Fig.3. Wherein, it is assumed that a first fetched 
instruction after resetting is a 3 -bytes instruction A which is not a branch 
instruction and a second fetched instruction is a 3 -bytes instruction B. 
Both the counter 309 and the latch circuit 310 just after the resetting are "0", 
as the latch circuit 310.is^"0", the control signal hue 312 is in high-level. 
Next, when a fetch cycle of a first bjrte of the first 3-bytes instruction A is 
started, the control signal line 222 becomes to be high-level, while the signal 
line 222 is in high-level, the output signal line 314 of the two-inputs AND 
circuit 313 is in high-level. When the control signal line 314 is in high-level, 
the gate 301 and the gate 302 become to be ON, a first byte on the internal 
data bus 201 is sent to the decoders 305 and 306 through the data bus 304. 
The decoder 305 decodes an instruction word length (=3) of the instruction A 
from the input first byte, and the instruction word length (=3) is output to 
the counter 309 through the signal hne 307. On the other hand, the decoder 
306 decides whether the instruction A is a branch instruction firom the input 
fiirst byte. In this case, as the instruction is not a branch instruction, the 
control signal hne 308 outputs low level. The counter 309 holds the 
instruction word length (=3) input through the signal hne 307, decrements 
the held value and outputs the decremented value (=2) to the latch circuit 
310 when the control signal"^hne 223'from the bus control part 213 becomes 
to be high-level. At this time, as the value of the latch circuit 310 becomes to 
be "2**, the control signal line 312 becomes to be low-level, after that as the 
output signal hne 314 of the two-inputs AND circuit 313 keeps low-level and 
the gate 301 and the gate 302 keep OFF even if the control signal line 222 
becomes to be high-level, data on the internal data bus 201 is not input to the 
data buses 303 and 304. Though the counter 309 keeps decremented value 
(=2), as the control signal line 223 becomes to be-high-level-at each fetch 
cycle, each time keeping value is decremented and the decremented value is 
returned to the latch circuit 310. 



As shown in Fig.3, when the control signal 223 becomes to be high-level at a 
fetch cycle of a third byte of the instruction A, the counter 309 decrements 
holding value (=l) and outputs the decremented value (=0) to the latch 
circuit 310. As the content of the latch circuit 310 becomes "0" again, the 
control signal line 312 becomes to be high-level, when the control signal line 
222 becomes to be high-level at a fetch cycle of the first byte of the 
instruction B, the output signal hne 314 of the two-inputs AND circuit 313 
becomes to be high-level, the gates 301 and 302 are turned to ON, a first byte 
of the instruction B on the internal data bus 201 is input to the decoders 305 
and 306. The decoder 305 decodes the instruction word length (=2) of the 
instruction B from the input first byte, and outputs the instruction word 
length (=2) to the.GOunter.309 through the signal hne 307. On the other hand, 
the decoder 306 determines whether the instruction B is a branch instruction 
firom the input first byte, when the instruction is assumed to be a branch 
instruction, in this case, the control signal line 308 outputs high-level. The 
counter 309 holds the instruction word length (=2) of the instruction B, 
decrements holding value and outputs decremented value (=1) to the latch 
circuit 310 when the control signal line 223 becomes to be high-level. 
Next, when the control signal 223 becomes to be high-level at a fetch cycle of 
a second byte of the instruction B, the counter 309 decrements holding value 
(=1) and outputs the decremented value (=0) to the latch circuit 310. As the 
value of the latch circuit 310 becomes to be "0", the control signal hne 311 
becomes to be high-level. On the other hand, as the control signal hne 308 of 
the decoder 306 is high-level, the output signal line 316 of the two-inputs 
AND circuit 315 becomes to be high-level too. When the control signal hne 
316 becomes to be high-level, the latch circuit 317 makes the control signal 
hne- 221 in low-level. The control signal hne 221 keeps output low-level 
until the control signal Hne 318 becomes to be low level. As the control signal 
line 221 is in low-level, the fetch request signal line 217 is in high-level, as 
the control signal line 221 is in low-level even if the fetch request is 
generated, an output signal hne 219 of the two-inputs AND circuit 216 keeps 
in low-level, a fetch request to the bus control 2 13 is not performed. The fetch 
request is not accepted until the branch destination address is decided, and 
the control signal line becomes-to be low-level and the-eontrol-signal hne 221 
becomes to be high-level. 

As shown in an embodiment above mentioned, minimizing of unproductive 



fetching is allowed by placing a pre-decoder before fetching an instruction in 
the processor from an external bus, detecting an instruction currently 
fetched is a branch instruction and the branch instruction and all byte 
composing the instruction are fetched in the processor so that an abortion 
request of fetching is output in earUer stage. 

[EjQfect of the Invention] 

As mentioned above, in the present invention, a pre-decoder is placed in a 
stage prior to fetch an instruction from an external data bus into an internal 
instruction buffer, a first byte of the instruction to be fetched is decoded, 
when the decoded instruction is a branch instruction, if it is detected that all 
bytes composing«thB branch instruction are fetched, an abortion request of 
fetching is issued immediately to abort instruction fetching to minimize 
instruction fetching which becomes unproductive by execution of the ranch 
instruction, then data access waiting time is decreased, improvement of 
computer processing speed can be obtained. 

[4.Brief description of Drawings] 

Fig. 1 is a block diagram of one embodiment of the present invention. 
Fig.2 is a detailed circuit diagram of a pre-decoder shown in Fig.l. 
Fig.3 is a timing chart at operation of the pre-decoder shown in Fig.2. 
Fig.4 is a block diagram of a CPU having traditional pre-fetch mechanism. 
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